Skip to main content

All Questions

2votes
0answers
666views

random_state on train_test_split() appears to have large effect in performance metrics?

To summarize the problem: I have a data set with ~1450 samples, 19 features and a binary outcome where classes are fairly balanced (0.51 to 0.49). I split the data into a train set and a test set ...
jlnsci's user avatar
3votes
1answer
449views

How do I use wavelet transform for feature extraction correctly?

I'm trying to classify words based on EMG signals using a support vector machine as my model. My dataset includes 15 classes (words) with 230 repetitions and 1000 features each. I already merged all ...
Rose's user avatar
2votes
1answer
2kviews

Imbalanced data set with Sample weighting - How to interpret the performance metrics?

Consider a binary classification scenario whereby the True class (5%) is severely outbalanced to the False class (95%). My data set contains numeric data. I am using SKLearn and trying some different ...
Jurgen Cuschieri's user avatar
0votes
0answers
221views

Results of quadratic SVM in Matlab are different from the results obtained in Python

I am trying to replicate a quadratic SVM classifier from Matlab to Python, however I am having different results regarding the accuracy. In Matlab the accuracy is 0.8955 meanwhile in Python the ...
BizarroJr's user avatar
1vote
2answers
44kviews

ValueError: y should be a 1d array, got an array of shape (285, 30) instead [closed]

I am using this data set below and I am trying to find the support vector machine of the data set. Also I have my code and error below as well. http://scikit-learn.org/stable/modules/generated/sklearn....
Adil Ali's user avatar
0votes
3answers
1kviews

Why is the accuracy of a LinearSVC not the same as the SDGClassifier?

I'm fine tuning parameters for a linear support vector machine. There are multiple ways to do it, but I wanted to compare LinearSVC and SDGClassifier in terms of time. I expected the accuracy score to ...
Anna's user avatar
0votes
1answer
247views

How to include validation set in the pipeline to tune parameters for an SVM?

I have a dataset already divided into train, test and validation set. How can I insert the validation in my pipeline? Code: ...
Anna's user avatar
1vote
2answers
734views

Ways to increase recall in SVM

I am training an SVM on UCI's Bank Marketing Data Set, the bank additional-full.csv. As the data is skewed I am also interested in recall. I am getting accuracy of about 87.95% but my recall is around ...
Tanmey Rawal's user avatar
1vote
0answers
188views

Do I have to wrap multiclass SVM in OneVsRestClassifier()?

I am using an SVM for mulitclass classification between 3 labels (1,0,-1). I thought this could simply be done by using SVC(decision_function_shape = 'ovr') in my ...
Hamish Gibson's user avatar
-1votes
1answer
205views

How to select the best features for Support Vector Classification

I have a feature set that contains approximately 2 dozen features of technical analysis indicators. My own domain knowledge tells me that some of these features are better than others for predicitive ...
Hamish Gibson's user avatar
1vote
1answer
483views

In an SVM, does a more negative/positive decision score mean that it is further from the seperating hyperplane?

For example, if I have a sample with a decision score of -6 and another with a score of -3, which sample is closer to the hyperplane? Also, does the probability of a sample belonging to a class ...
MangLion's user avatar
1vote
0answers
617views

How to create an roc plot and calculate AUC for an svm (that does not return probabilities)?

I have some SVM classifier outputting final classifications for every sample in the test set, something like 1, 1, 1, 1, 0, 0, 0, 1, 0, 0, 1, 1 and so on. The "...
Gulzar's user avatar
0votes
2answers
128views

Confusion on result of K-Fold Cross Validation and Independent Test set

I am relatively new in Machine Learning. I am using Random Forest and SVM for a project. Where I did a ...
Nahid's user avatar
2votes
0answers
404views

Why are the regions/decision boundaries overlapping with multi-class classification using SVM in sci-kit?

I am using the SVM in scikit-learn library for doing multiclass classification. I am wondering why these regions (decision boundaries) are overlapping (as seen in the picture below)? Could someone ...
Prospero's user avatar
2votes
1answer
27views

When to question output of model

I'm unsure of how to ask a question without making it seem like a code review question. At what point does one question whether they've actually implemented the algorithm and-or model correctly? ...
user3727648's user avatar

153050per page
close